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for power spectral density functions 1 

Tryphon T. Georgiou^ 

o 

O ' Abstract 

<N 

bJQ| We present an intrinsic metric that quantifies distances between power spectral density functions. The metric 

. was derived by the author in [3] as the geodesic distance between spectral density functions with respect to a 

■ particular pseudo-Riemannian metric motivated by a quadratic prediction problem. We provide an independent 

verification of the metric inequality and discuss certain key properties of the induced topology. 
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I. The metric property 



;> r ■ 1 HE present work builds on a recent report [3] where the present author introduced a natural 

*so : 

00 ■ _M_ pseudo-Riemannian metric on power spectral density functions of discrete-time stochastic processes, 

00 ' characterized geodesies, and computed geodesic distances. The geodesic distance between two power 

O ! 

^ ■ spectral density functions /;(#), with i = 1, 2 and 9 G [— 7r, it], was shown to be 




I Below we will provide a direct verification that d g (-, ■) provides a pseudo-metric on the cone of power 
?h spectral density functions 

V:={f : f(9)>0for9e[-n,n], f eL.i-n^]}. 

(As usual, Li,L 2 denote Lebeague spaces of integrable and square-integrable functions, respectively.) 

The only reason d g (-, •) is a pseudo-metric and not a metric is because it is insensitive to scaling, i.e., 
d g (fi, f 2 ) = d g (fi, ft/2) for any k > 0. Thus, it does not differentiate between spectral densities which 
only differ by a constant nonzero positive factor. Families of spectral density functions related in this way 
are referred to as spectral rays and form a set 

TZ := {/modM+ : f(0) > for 6 G [-7T,7r], / G Li[-7T,7r]} 
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research has been supported by the NSF and the AFOSR. 
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of equivalence classes, and d g (-, •) can be used to evaluate distances on 1Z via comparing any two 
representatives on any two given spectral rays. Then, as we will see, d g (-, •) defines a metric on 1Z. This 
metric can be also be readily modified to provide a metric on V if for instance, \f* (fi(0) — h{6)) y~\, 
or the absolute difference of any other generalized means is added on as in 

d6, 



d(fij 2 ) := d g (f u f 2 ) + \ (fa(6) - f 2 (6)) 

J-K 2?T 

to differentiate the effect of scaling. 

Before we proceed, we clarify how to evaluate d g (-,-) on all spectra in V, including those that may 
vanish on a subset of the frequency interval [— 7r,7r] rendering \og(fa/ f 2 ) 2 non-integrable. Clearly, when 
neither argument of d g (fi, f 2 ) vanishes and fa(9) stays away from zero for 9 £ [— rr, ix] and i = 1,2, then 
log/j £ L 2 \— n, 7r] and d g (fi, f 2 ) is well defined and finite. But, if either fa (i = 1, 2) vanishes on [— 7r, it] 
the integrals may diverge. However, since the root-mean-square of any function, and hence of log(/i// 2 ) 
in particular, is always greater than or equal to its arithmetic mean (e.g., see [2]) it follows that 




Therefore, O gives either a nonnegative real or has to be assigned the value +00. In conclusion, we 
complete the definition of d g (-, •) as follows: if 

logfy e L 2 [-7r,7r], (3) 
h 

in which case the left hand side of © is finite, d 9 (fi, f 2 ) is evaluated using (QJ. If however © fails then, 
for consistency with ([T]), we assign 

d a (/i,/ 2 ):=oo. (4) 

Clearly, failure of © can always be traced to at least one of fa (i £ {1,2}) failing to satisfy log/j £ 
L 2 [-7r,7r] (otherwise, necessarily, \og(f 1 /f 2 ) = (log/i - log/ 2 ) £ L 2 [-7r,7r]). 

Theorem 1: d g (-, •) defines a pseudo-metric on I? and a metric on TZ. 

Proof: By definition •) £ [0, 00) U {+00}. It is also easy to observe that 

d g tfi,h)=d g {h,h). (5) 

To see this note that log(/i// 2 ) = — log(/ 2 //i) and that (UJ) is impervious to a sign change in front of 
the logarithms. Also in case one of log(/i// 2 ), log(/ 2 //i) fails to be in L 2 , so does the other, and again 
dg{fi, f2) = d g (f 2 , fa) (both being 00). Thus, © holds. 
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When d g (fi, f 2 ) = the root-mean-square of the function log(/i// 2 ) is equal to its arithmetic mean, 
and this only happens (see [2]) when the function is constant, i.e., 

<Wi,/ 2 ) = => log^ = ceR for all 9 e [-n,n] 
h = «/2 

/xmodM = / 2 modK, 

since k = e c is a constant. Thus, d g (-, •) separates the elements of 1Z. 

We finally establish the triangular inequality. So let us consider fa e V for i E {1, 2, 3}. We will show 
that 

dg(fuf2) + d g {f 2 ,h)>d g (f 1 ,f z ). (6) 

We first argue the case when d g (f 1 , f 3 ) = oo. It suffices to show that one of the left hand side terms is 
also infinity. Assume the contrary, i.e., that 

h h 
log — e L 2 [-n,n] as well log — G L 2 [-n,n]. 

h J3 
It readily follows that log(/i// 3 ) = log(/i// 2 ) + ^og(f 2 / f 3 ) £ £ 2 [— 7r] which contradicts the assumption 
that d g (fi, fs) = oo. Thus, at least one of d g (fx,f 2 ), d g (f 2 , / 3 ) is infinity and the triangular inequality 
holds. Of course, if log(/i// 3 ) is finite and any of d 9 (/i, / 2 ), d g (f 2 , f 3 ) takes the value oo, the triangular 
inequality holds anyway. 

We now argue the case when all three d g (fi,f 2 ), d g (f 2 , / 3 ) and d g (fi, f 3 ) are finite. To this end we 
square both sides of © and utilize 



, fl , /l . i /2 

log 7- = !°g 7- + log—, 

J3 J2 J3 

to simplify the resulting expression and deduce the following inequality 



(7) 



fiVde ( r\ hde\ 2 r / ; 2 \ 2 ^ /r / 2 ^\ 2 



> , |log A log Mf_f/ log 4fU/ log ^v (8) 



Thus the two inequalities © and © are equivalent to one another, and therefore, in order to ascertain 
© it suffices to establish the validity of ®. 

To this end, let a := log(/i// 2 ), /5 := log(/ 2 // 3 ) and rewrite (El) in the form 



2vr 2tt' 2vr 2vr 

, do ( r dd\ ( r de\ 
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Since © is homogeneous in both a and (3, scaling of either leaves it unaffected. Therefore, if 




Gp = 

and a := ^-a, b := -±-(3, the inequality © is equivalent to 



f w , n dO ( r dd\ ( r i dO\ 
1 > J (ab) a-) [ I b-) 



with the side conditions 



2vr J \J_„ 2vr ) 



^dd_rdd = and 

2lT ]_„ 111 



2tt V_ w 2tt 

But the validity of ( fTTjt follows trivially from the standard inequality 

dO ( r , i,d6\ 2 



pM_,r b M ) 2 = L (12) 



after we expand the squares on both sides and use (fTTT) and (fT2l) to simplify the resulting expressions. 
Thus 4T0j> with (I11H12I) holds =4> © dHJ) ©, and this completes the proof. ■ 
Remark: The definition of d g (-, •) distinguishes two classes of power spectral densities according to 
whether their logarithm is square integrable or not. The first class, ©interior : = {/ £ © : log/ £ 
L 2 [— 7r, 7r]}, can be thought of as "interior" points lying to within a finite distance from one another, and to 
within a finite distance from constant non-zero power spectral densities. The second class, with logarithms 
that fail to be square integrable, contains power spectral densities which lie at an infinite distance from any 
density in ©interior- On the other hand, power spectal densities are traditionally differentiated according 
to whether the underlying process is deterministic or not. More specifically, a stochastic process is said 
to be non-deterministic (in the sense of Kolmogoroff) if the variance of the one-step-ahead prediction 
error cannot be made arbitrarily small. In turn, this property is characterized by the log-integrability of 
the corresponding power spectral density function (see [4], [5]), i.e., ©non-deterministic :={/£© : 
log/ £ Li[— vr,7r]}. Thus, it is interesting to observe that ©interior C ©non-deterministic and hence, finite 
neighborhoods of elements in ©interior contain non-deterministic power spectra only. 

II. RlEMANNIAN GEOMETRY, GEODESICS, AND INTRINSIC METRICS 

We now explain the geometric significance of d g (-, •) recapitulating some of the development in [3]. The 
starting point that led to (HJ) is a prediction problem and the degradation of the variance of the prediction 
error when the design of the predictor is based on the wrong choice among two alternatives. More 
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specifically, let fi, f 2 represent spectral density functions of discrete-time zero-mean stochastic processes 
Uf % (k) (i G {1,2} and k G Z), and let Pf^i) (I G {1,2,3, . . .}) represent values for the coefficients that 
minimize the linear prediction error variance 

oo 

s{\u f M-j2p(^fA-m- 

Thus, the optimal set of coefficients depends on the power spectral density function of the process, a 
fact which is duly acknowledged by the subscript in the notation Pfi(£). Here, as usual, £{ } denotes 
the expectation operator. It is reasonable to consider as a distance between / x and f 2 the degradation 
of predictive error variance when the coefficients p(£) are selected assuming one of the two, and then 
used to predict a stochastic process corresponding to the other spectral density function. The ratio of the 
"degraded" predictive error variance over the optimal error variance 

n(f n _ ^fM-ET=iPh(^ h (-£)\ 2 } 
PUuj2) ■ £{u h (o)-E7=iP.fMu h (-m 

turns out to be equal to the ratio of the arithmetic over the geometric means of the fraction of the two 
spectral density functions, namely 



P(fi, h 



7T h{9) M 

He) 2tt 



/7T 
— TV 



exp(.r:>g(£g)g 



r i £ £ \ , f rfi(9)d9\ r f fl(9)\ 



see [3]. 

The logarithm logp(/i,/ 2 ) =: S 3i / g (fi, f 2 ) (where the subscript a/g signifies arithmetic/geometric) 

represents a measure of dissimilarity between the "shapes" of fi and f 2 and, can be viewed, as analogous 

to "divergences" of Information Theory. Indeed, 

n f 1 (9)d9\ f\„Jfi(9)\ d6_ 

2tt 

vanishes only when fi/ f 2 is constant on [— 7r,7r] and is positive otherwise. Considering the distance 
^a/g(/; / + A) between a nominal power spectral density / and a perturbations / + A, and eliminating 
cubic terms and beyond, leads (modulo a scaling factor of 2) to the Riemannian pseudo-metric in Pmtcrior 
which is given by the following quadratic differential form 

9I(A) := LAW) ) r, - [Lm t.) ■ (13) 

Interestingly, geodesic paths f T (r G [0, 1]) connecting spectral densities / , fi an d having minimal length 



V 7 ^^ \/S a / s (fr, fr+dr) = J ^Qfri^W 



can be explicitely computed [3]. They turn out to be logarithmic intervals 

fr(e) = ft T (9)fI(9) for re [0,1], (14) 
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between the two extreme points. Furthermore, the length along such geodesies is precisely d g (f , fi) as 
given in ©. 

The closed form of the geodesic path allows us to verify directly that any two power spectral densities 
fo, fi, at a finite distance from one another, can be connected with a path of the same length. A topological 
space with such a property is said to be a length-space and the metric is said to be intrinsic. The fact 
that d g (-,-) is intrinsic can be readily verified and this is done below. 

Proposition 2: d g (-,-)\s intrinsic on V and TZ. 

Proof: By direct substitution into (UJ) we can verify that for any fo, fi such that d g (fo, fi) < oo, 
any r e [0,1], and with f T defined as in ©Jl, d g (f , f T ) = Td g (f ,fi), d g (f T ,f 1 ) = (1 - r)d fl (/ ,/i), 
and even d g (f T , f T +dr) = d g (f ,fi)dr. It readily follows that the length of the path d g (f T , f T +dr) = 
fo d g (fo, h)dr equals the distance d g (fo, f\) between the end points. ■ 

Remark: Besides <5 a / g (./i, / 2 ), several other "divergences" have been introduced in [3] as appropriate 
distance measures (though not metrics). First the symmetrized version 

S(fi, h) = 5 a / g (/i, h) + S a/S (f 2 , fi), 

and then, 

involving comparison of other generalized means. It is interesting to point out that the quadratic terms of 
$r,s(f, / + 5 a / g (/, / + A) and 5(f, f + A) in the "perturbation" A, all turn out to be identical (modulo 
a scaling). Hence, they all lead to the same Riemannian pseudo-metric (fT3l . 

III. Concluding thoughts 

It is interesting to compare the differential structure on power spectral density functions which we have 
introduced above with the corresponding differential structure of "Information Geometry." In Information 
Geometry f{6) corresponds to a probability density on [— n, n] and the natural Riemannian metric is the 
Fisher information metric is (cf. [1, page 28]) which is this case would be 




(with JZ n f(^)^ = 1 an d f\k{6)^ = since both /, / + A need to be probability densities). Direct 
comparison reveals that the powers of f(9) in ( fT3l and (1131) are different. Thus, it is curious and worth 
underscoring that in either differential structure, geodesies and geodesic lengths can be computed. 
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